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1 Introduction 

This is an entirely expository piece: the main resuhs discussed are very well- 
known and the approach we take is not really new, although the presentation 
may be somewhat different to what is in the literature. The author's main 
motivation for writing this piece comes from a feeling that the ideas deserve to 
be more widely known. 

Let g be a Lie algebra over R or C. A vector subspace / C g is an ideal 
if [/, g] C /. The Lie algebra is called simple if it is not abelian and contains 
no proper ideals. A famous result of Cartan asserts that any simple complex 
Lie algebra has a compact real form (that is to say, the complex Lie algebra 
is the complexification of the Lie algebra of a compact group). This result 
underpins the theory of real Lie algebras, their maximal compact subgroups 
and the classification of symmetric spaces. In the standard approach, Cartan's 
result emerges after a good deal of theory: the Theorems of Engel and Lie, 
Cartan's criterion involving the nondegeneracy of the Killing form, root systems 
etc. On the other hand if one assumes this result known-by some means-then 
one can immediately read off much of the standard structure theory of complex 
Lie groups and their representations. Everything is reduced to the compact case 
(Weyl's "unitarian trick"), and one can proceed directly to develop the detailed 
theory of root systems etc. 

In [3], Cartan wrote 

J'ai trouve effectivement une telle forme pour chacun des types de groupes 
simples. M. H. Weyl a demontre ensuite V existence de cette forme par une 
raisonnement general s 'appliquant a tous les cas a fois. On peut se deman- 
der si les calculs qui I'ont conduit a ce resultat ne pourraient pas encore se 
simplifier, ou plutot si Von ne pourrait pas, par une raissonnement a priori, 



demontrer ce theoreme; une telle demonstration permettrait de simplifier no- 
tablement I'exposition de la theorie des groupes simples. Je ne suis a cet egard 
arrive a aucun resultat; j'indique simplement I'idee qui m'a guide dans mes 
recherches infructueuses. 

The direct approach that Cartan outhned (in which he assumed known the 
nondegeneracy of the Kilhng form) was developed by Helgason (see page 196 
in [4]), and a complete proof was accomplished by Richardson in [M]. In this 
article we revisit these ideas and present an almost entirely geometric proof of 
the result. This is essentially along the same lines as Richardson's, so it might 
be asked what we can add to the story. One point is that, guided by modern 
developments in Geometric Invariant Theory and its relations with differential 
geometry, we can nowadays fit this into a much more general context and hence 
present the proofs in a (perhaps) simpler way. Another is that we are able to 
remove more of the algebraic theory; in particular, the nondegeneracy of the 
Killing form. We show that the results can be deduced from a general principle 
in Riemannian geometry (Theorem 4). The arguments apply directly to real 
Lie groups, and in our exposition we will work mainly in that setting. In the 
real case the crucial concept is the following. Suppose is a Euclidean vector 
space. Then there is a transposition map A i— > on the Lie algebra End V. 
We say a subalgebra g C End V is symmetric with respect to the Euclidean 
structure if it is preserved by the transposition map. 

Theorem 1 Let g be a simple real Lie algebra. Then there is a Euclidean 
vector space V, a Lie algebra embedding g C End(T^), and a Lie group G C 
SLiy) with Lie algebra g, such that g is symmetric with respect to the Euclidean 
structure. Moreover, any compact subgroup of G is conjugate in G to a subgroup 
ofGnSO{V). 

We explain in (5.1) below how to deduce the existence of the compact real 
form, in the complex case. Theorem 1 also leads immediately to the standard 
results about real Lie algebras and symmetric spaces, as we will discuss further 
in (5.1). 

The author thanks Professors Martin Bridson, Frances Kirwan, Zhou Zhang 
and Xuhua He for comments on the earlier version of this article. 

2 More general setting 

Consider any representation 

p : SL{V) -> SL{W), 

where V, W are finite-dimensional real vector spaces. Let w he a nonzero vector 
in W and let Gw be the identity component of the stabiliser of w in SL{V). 
Then we have 

Theorem 2 IfVis an ireducible representation of Gw then there is a Euclidean 
metric on V such that the Lie algebra of Gw is symmetric with respect to the 
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Euclidean structure, and any compact subgroup of is conjugate in Gw to a 
subgroup of Gw H SO{V). 

Now we will show that Theorem 2 implies Theorem 1 . Given a simple real Lie 
algebra g, consider the action of iS'L(g) on the vector space W of skew symmetric 
bilinear maps from g x g to g. The Lie bracket of g is a point w in W. The 
group Gyj is the identity component of the group of Lie algebra automorphisms 
of g, and the Lie algebra of Gw is the algebra Der(g) of derivations of g, that 
is, linear maps 6 : g ^ g with 

5[x,y] = [5x,y] + [x,5y]. 

The adjoint action gives a Lie algebra homomorphism 

ad : g — > Der(g). 

The kernel of ad is an ideal in g. This is not the whole of g (since g is not 
abelian) so it must be the zero ideal (since g is simple). Hence ad is injective. If 
U is a. vector subspace of g preserved by Gw then any derivation 5 must map U 
to U. In particular ad^ maps U to U for any ^ in g, so [g, f7] C C/ and U is an 
ideal. Since g is simple we see that there can be no proper subspace preserved 
by Gw and the restriction of the representation is irreducible. By Theorem 2 
there is a Euclidean metric on g such that Der(g) is preserved by transposition. 
Now we want to see that in fact Der(g) = g. For a G Der(g) and ^ £ g we have 

[ad^,a] = ada{^), 

so g is an ideal in Der(g). Consider the bilinear form 

B{ai,a2) = Tr(aia2) 

on Der(g). This is nondegenerate, since Der(g) is preserved by transposition 
and B{a,a^) = |ap. We have 

B([a,/?],7) + S(/3, [a,7])=0 

for all a,/3, 7 G Der(g). Thus the subspace 

gperp = g Derg : B{a, ad^) = for all ^ e g} 

is another ideal in Der(g). On the other hand the map a —a^ is an auto- 
morphism of Der(g), so g-^ is also an ideal in Der(g). Suppose that gflg-^ ^ 0. 
Then we can find a non-zero element a of g fl g^ with = ±a and then 
B{a,a) = ±|q!|^ ^ 0, so the restriction of i? to g is not identically zero. This 
means that / = g n gP'^'P is not the whole of g, but / is an ideal in g so, since 
g is simple, we must have 7 = 0. 

We conclude from the above that if g were a proper ideal in Der(g) there 
would be another proper ideal J in Der(g) such that J n g = 0. (We take J to 
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be either or gP®''P.) But then for a G J we have [a, g] = 0, but this means 
that a acts trivially on g, which gives a contradiction. 

Finally, the statement about compact subgroups in Theorem 1 follows im- 
mediately from that in Theorem 2. 

(The argument corresponding to the above in the complex case (see (5.1)) 
is more transparent. ) 

3 Lengths of vectors 

We will now begin the proof of Theorem 2. The idea is to find a metric by 
minimising the associated norm of the vector w. In the Lie algebra situation, 
which we are primarily concerned with here, this is in essence the approach 
suggested by Cartan and carried through by Richardson. In the general situation 
considered in Theorem 2 the ideas have been studied and applied extensively 
over the last quarter century or so, following the work of Kempf-Ness |8], Ness 
[12] and Kirwan [S]. Most of the literature is cast in the setting of complex 
representations. The real case has been studied by Richardson and Slodowy 
[13] and Marian [TT] and works in just the same way. 

Recall that we have a representation p of SL{V) in SL{W), where V and W 
are real vector spaces, a fixed vector w €W and we define to the stabiliser 
of w in SL{V). Suppose we also have some compact subgroup (which could be 
trivial) Kq C G^u. We fix any Euclidean metric | |i on 1/ which is preserved by 
Kq. Now it is standard that we can choose a Euclidean metric | |vf on which 
is invariant under the restriction of p to SO{V). We want to choose this metric 
I \\Y with the further property that the derivative dp intertwines transposition 
in EndX^ (defined by | |i) and transposition in Endll^ (defined by | \w)', that is 
to say 

To see that this is possible we can argue as follows. We complexify the rep- 
resentation to get PC ■ SL{V (g) C) — > SL{W C). Then the compact group 
generated by the action of SU{V ® C) and complex conjugation acts onW 
and we can choose a Hermitian metric on ly ® C whose norm function is invari- 
ant under this group. Invariance under complex conjugation means that this 
Hermitian metric is induced from a Euclidean metric on W. Then the fact that 
pc maps SU{V (E> C) to SU{W ® C) implies that dp has the property desired. 
(The author is grateful to Professors He and Zhang for pointing out the need 
for this argument. In our main application, to Theorem 1, the standard metric 
on W already has the desired property.) 
Now define a function F on SL{V) by 

%) = l5Hlw- 
For u e SO{V) and 7 £ we have 
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So F induces a function F on the quotient space Ti. = SL{V)/SO{V), invariant 
under the natural action of Gyj C SL{V). We can think about this in another, 
equivalent, way. We identify H with the Euclidean metrics on of a fixed 
determinant. Since p : SL{V) SL{W) maps SO{V) to SO{W) it induces 
a map from SL{V)/SO{V) to SL{W)/SO{W) and so a metric on V with the 
same determinant as | |i induces a metric on W. Then F is given by the 
square of the induced norm of the fixed vector w. Explicitly, the identification 
of SL(y)/ SO(y) with metrics is given by [5] | |g where 

= \9v\l = {v,g'^gv)i. 

This function F has two crucial, and well-known, properties, which we state 

in the following Lemmas 

Lemma 1 Suppose F has a critical point at H G Ti. Then the Lie algebra of 
the stabiliser is symmetric with respect to the Euclidean structure HonV. 

To prove this, there is no loss in supposing that H is the original metric | |i. 
(For we can replace w by gw for any g G SL{V).) The fact that p maps SO{V) 
to SO{W) implies that its derivative takes transposition in End^ (defined by 
I |i) to transposition in EndW^ defined by | \w The condition for V to be 
stationary is that 

{dp{^)w,w)w = 

for all ^ in the Lie algebra of SL{V). In particular consider elements of the form 
^ = [t], V^] and write A = dp{r]). Then we have 

= {dp[r], j]^]w, w)w = {[A, A^]w, w)w = \A^w\ly - \Aw\ly. 

By definition 77 lies in the Lie algebra of Gyj if and only if Aw = 0. By the 
identity above, this occurs if and only if A^w = 0, which is just when r]-^ lies in 
the Lie algebra of Gw 

For the second property of the function we need to recall the standard notion 
of geodesies in H. We can identiiy H with the positive definite symmetric 
elements of SL{V), with the quotient map SL{V) — > H given hy g g^g. 
Then the geodesies in H are paths of the form 

7(t) =5^exp(5%, (1) 

where g and S are fixed, with g e SLiV) and S a trace-free endomorphism 
which is symmetric with respect to | |i. Another way of expressing this is that 
a geodesies through any point H & H is the orbit of H under a 1-parameter 
subgroup e{t) in SL{V) where e{t) = exp(at) with a a symmetric endomorphism 
with respect to the metric H. 

Lemma 2 1. For any geodesic 7 the function F o 7 is convex i.e. 



2. If F achieves its minimum in H then acts transitively on the set of 
minima. 

To prove the first part, note that, replacing w by gw, we can reduce to 
considering a geodesic through the base point [1] G H, so of the form exp(5't) 
where S is symmetric with respect to | |i. Now the derivative dp maps the 
symmetric endomorphism 5 to a symmetric endomorphism A e End(VK). We 
can choose an orthonormal basis in W so that A is diagonal, with eigenvalues 
Aj say. Then if w has coordinates Wi in this basis we have 

F{exp{St)) = F{exp{St/2)) = ^ | exp(A,t/2)«;,|2, = ^ Iwi]"" exp(Ait), 

and this is obviously a convex function of t. 

To prove the second part note that, in the above, the function F{exp{St) is 
either strictly convex or constant, and the latter only occurs when A^ = for 
each index i such that Wi ^ 0, which is the same as saying that exp{St)w = w 
for all t, or that the 1-parameter subgroup exp{St) lies in G^,. More generally 
if we write a geodesic through a point H as the orbit of H under a 1-parameter 
subgroup e{t) in SL{V) then the function is constant if and only if the 1- 
parameter subgroup lies in Gw Suppose that H\,H2 are two points in H where 
F is minimal. Then F must be constant on the geodesic between Hi, H2. Thus 
H2 lies in the orbit of Hi under a 1-parameter subgroup in Gw So G^, acts 
transitively on the set of minima. 

We now turn back to the proof of Theorem 2. Suppose that the convex 
function F on H achieves a minimum at Hi e Ti. Then by Lemma 1 the Lie 
algebra of G^, is symmetric with respect to the Euclidean structure Hi on V . 
It only remains to see that the compact subgroup Kq of G^ is conjugate to a 
subgroup of the orthogonal group for this Euclidean structure. For each H gH 
we have a corresponding special orthogonal group SO{H,V) C SL{V). For 
g e SL{V) the groups SO{H,V), SO{g{H),V) are conjugate by g in SL{V). 
Recall that we chose the metric | |i to be ii'o invariant. This means that Kq fixes 
the base point [1] in Ti.. Suppose we can find a point Hq in Ti. which minimises 
F and which is also i^o-invariant. Then Kq is contained in SO{Hq, V). But by 
the second part of Lemma 2 there is a 7 e G^, such that 7(-ffo) = Hi. Thus 
conjugation by 7 takes SO{Ho,V) to SO{Hi,V) and takes Kq to a subgroup 
on SO{Hi, V), as required. 

To sum up. Theorem 2 will be proved if we can establish the following result. 

Theorem 3 Let F be a convex function on Ti, invariant under a group G^ C 
SL(V). Let Kq be a compact subgroup of Gw and let [1] e TL be fixed by Kq. 
Then if V is an ireducible representation of Gw there is a point Hq €H where 
F achieves its minimum and which is fixed by Kq . 

(Notice that the hypothesis here that there is a point [1] S H fixed by Kq is 
actually redundant, since any compact subgroup of SL{V) fixes some metric.) 
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4 Riemannian geometry argument 



In this section we will see that Theorem 3 is a particular case of a more general 
result in Riemannian geometry. Let M be a complete Riemannian manifold, so 
for each point p e M we have a surjective exponential map 

expp : TMp M. 

We suppose M has the following property 

Property (*) 

For each point p in M the exponential map exp^ is distance-increasing 

Readers with some background in Riemannian geometry will know that it 

is equivalent to say that M is simply connected with nonpositive sectional cur- 
vature, but we do not need to assume knowledge of these matters. The crucial 
background we need to know is 



Fact 

There is a metric onH ~ SL{V)/SO{V) for which the action of SL{V) is 
isometric, with the geodesies described in (1) above and having Property (*). 

This Riemannian metric on Ti can be given by the formula 

\\5H\\l = T,{5HH-')\ 

The distance- increasing property can be deduced from the fact that H. has non- 
positive curvature and standard comparison results for Jacobi fields. For com- 
pleteness, we give a self-contained proof of the Fact in the Appendix. 

The piece of theory we need to recall in order to state our Theorem is the 

notion of the "sphere at infinity" associated to a manifold M with Property (*). 
This will be familiar in the prototype cases of Euclidean space and hyperbolic 
space. In general, for a; G M write for the unit sphere in the tangent space 
TM^ and define 

: M \ {x] ^ 5, 

by 



|expa;^(2;)| 

If y is another point in M and R is greater than the distance d = d{x, y) we 
define 

by 

FR,x,y{v) = Qxexpy{Ri^). 
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Lemma 3 For fixed x, y, v the norm of the derivative of Fii^x,v,w with respect 
to R is hounded by 

< ^(^_^)- 

Let 7 be the geodesic 7(t) ~ cxpy{ti'), let w be the point jiR) and let cr be 
the geodesic from x to w. The distance-increasing property of exp^ implies that 
the norm of the derivative appearing in the statement is bounded by d{x,'w)~^ 
times the component of 'j'{R) orthogonal to the tangent vector of a at w. Thus 

.9 sin(/) 

where cj) is the angle between the geodesies 7, cr at w. By the triangle inequality 
d{x, w) > R — d. In a Euclidean triangle with side lengths d, R the angle 
opposite to the side of length d is at most sm~^ (d/R). It follows from the 
distance-increasing property of exp^ that sin <p < d/R. Thus 

sin 'P ^ d 
d{x,w) - R{R - d) ' 

as required. 

Since the integral of the function l/R{R — d), with respect to R, from R = 2d 
(say) to i? = c», is finite, it follows from the Lemma that FR,x,y converges 
uniformly as i? ^ 00 to a continuous map F^.y '■ Sy Sx, and obviously F^.x 
is the identity. Let z be another point in M and u he a unit tangent vector at 
z. Then we have an identity, which follows immediately from the definitions, 

Fr,x,z{v) = FB,\x,y ° FR^y^z{v), 

where R' = d{y,exp^{Ri')). Since, by the triangle inequality again, 

R' >R-d{y,z), 
we can take the limit as ^ 00 to obtain 

Fx^z — Fx,y Q Fy,z '• Sz ^ Sx- 

In particular, Fy^^ is inverse to Fx^y so the maps F^^y give a compatible family of 
homeomorphisms between spheres in the tangent spaces. We define the sphere 
at infinity Soc{M) to be the quotient of the unit sphere bundle of M by these 
homeomorphisms, with the topology induced by the identification with Sxq for 
any fixed base point xo- 

Now suppose that a topological group F acts by isometries on M. Then F 
acts on Soo{M), as a set. Explicitly, if we fix a base point xq and identify the 
sphere at infinity with Sx^,, the action of a group element g( € F is given by 

g{u) = lim e^(,c/(exp^^i?z^). 
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Write the action as 

^ : r X Sx„ > Sxq ■ 
Given a compact set P C F we can define 

for sufficiently large R, by 

Since g{exp^^ Rv) = exp^j^^o) {Rg*^) the maps Ar converge uniformly as i? — *■ oo 
to the restriction of A to P x S^^- It follows that the action A is continuous. 
With these preliminaries in place we can state our main technical result. 

Theorem 4 Suppose that the Riemannian m,anifold M has Property (*). Sup- 
pose that r acts by isometrics on M and F is a convex T -invariant function 
on M. Then either there is a fixed point for the action ofV on Soc{M) or the 
function F attains its minimum in M. Moreover, in the second case, if Kq is a 
subgroup of T which fixes a point x G M , then there is a point x' G M where F 
attains its minimum in M and with x' fixed by Kq. 

Return now to our example H. The tangent space at the identity matrix [1] 
is the set of trace-free symmetric matrices. We define a weighted flag [J^, /j.) to 
be a strictly increasing sequence of vector subspaces 

= FoCFiCF2...cFr. = V 

with associated weights /^i > /U2 • • • > A'd subject to the conditions 

^Uii^i = 0,^nii4 = 1, 

where Ui = dimFi/ Fi^i. If S* is a trace- free symmetric endomorphism with 
Tr 5^ = 1 then we associate a weighted flag to S as follows. We take /x^ to be 
the eigenvalues of S, with eigenspaces Ei, and form a flag with 

Fi = El , F2 ^ El (B E2 , . . . . 

It is clear then that the unit sphere S'fij in the tangent space of H at [1] can be 
identified with the set of all weighted flags. Now there is an obvious action of 
SL{V) on the set of weighted fiags and we have: 

Lemma 4 The action of SL{V) on the sphere at infinity in H coincides with 
the obvious action under the identifications above. 

This is clearly true for the subgroup SO{V). We use the fact that given any 
weighted flag {J-,n) and g S SL{V) we can write g — uh where u £ SO{V) 
and h preserves J^. (This is a consequence of the obvious fact that SO{V) acts 
transitively on the set of flags of a given type.) Thus it suffices to show that such 
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h fix the point S in the unit sphere corresponding to (jF, fi) in the differential- 
geometric action. By the SO{V) invariance of the set-up we can choose a basis 
so that is the standard flag 

c R"i c R"i e R"^ . . . c R". 

Then S is the diagonal matrix with diagonal entries m,. . . ,^r, repeated accord- 
ing to the multiplicities ni, . . . , n^. The matrix h is upper triangular in blocks 
with respect to the flag. Now consider, for a large real parameter R the matrix 

Mr = exp{ — —)hexp{—). 
Consider a block hij of h. The corresponding block of Mr is 

Since h is upper-triangular in blocks and the Hi are increasing, we see that Mr 
has a limit as R tends to infinity, given by the diagonal blocks in h. Since these 
diagonal blocks are invertible the limit of M{R) is invertible, hence 

6R = Tr{log{MRM*R)f 

1/2 

is a bounded function of R. But 5j{ is the distance in Hn between exp(i?5) 

and /i cxp (i?S') ft. It follows from the comparison argument, as before, that the 
angle between Q^^{hex.p{RS)h^) and S tends to zero as i? — >■ oo, hence h fixes 
S in the differential geometric action. 

Now Theorem 3 is an immediate consequence of Theorem 4 and Lemma 4, 
since if fixes a point on the sphere at infinity in H it fixes a flag, hence some 
non- trivial subspace of V, and V is reducible as a representation of Gyj. 
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Remarks 

• The advantage of this approach is that Theorem 4 seems quite accessible 
to geometric intuition. For example it is obviously true in the case when 
M is hyperbolic space, taking the ball model, and we suppose that F 
extends continuously to the boundary of the ball. For then F attains its 
minimum on the closed ball and if there are no minimising points in the 
interior the minimiscr on the boundary must be unique (since there is a 
geodesic asymptotic to any two given points in the boundary). 

• The author has not found Theorem 4 in the literature, but it does not seem 
likely that it is new. There are very similar results in yy for example. The 
author has been told by Martin Bridson that a more general result of this 
nature holds, in the context of proper CAT(O) spaces. The proof of this 
more general result follows in an obvious way from Lemma 8.26 of [5j (see 
also Corollary 8.20 in that reference). 

• The hypothesis on the existence of a fixed point x for Kq in the statement 
of Theorem 4 is redundant, since any compact group acting on a manifold 
with Property (*) has a fixed point, by a theorem of Cartan (see the 
remarks at the end of Section 3 above, and at the end of (5.2) below). 
However we do not need to use this. 

We now prove Theorem 4. We begin by disposing of the statement involving 
the compact group Kq- Suppose that F attains its minimum somewhere in M . 
Then, by convexity, the minimum set is a totally geodesic submanifold S C M. 
The action of Kq preserves E, since F is F-invariant and Kq is contained in F. 
Let x' be a point in S which minimises the distance to the Kq fixed-point x. 
Then if x" is any other point in S the geodesic segment from x' to x" lies in E 
and is orthogonal to the geodesic from x to x' at x' . By the distance-increasing 
property of the exponential map at x' it follows that the distance from x to x" 
is strictly greater than the distance from x to x' . Thus the distance- minimising 
point x' is unique, hence fixed by Kq. 

To prove the main statement in Theorem 4 we use the following Lemma. 

Lemma 5 Suppose that M has Property (*) and N is any set of isometries of 
M . If there is a sequence Xi in M with d{xo, Xi) oo and for each g £ N there 
is a Cg with d{xi,gxi) < Cg for all i, then there is a point in Soo{M) fixed by 
N. 

Set Ri = d{xo,Xi) and Vi = Qxai^i) S Sxq- By the compactness of this 
sphere we may suppose, after perhaps taking a subsequence, that the z/^ converge 
as i tends to infinity to some i> 6 S^o- Then for each g £ N we have, from the 
definitions, 

ARi{9,i^i) = ^xo{gXi). 
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Fixing g, let (pi be the angle between the unit tangent vectors Vi = Oxoi^i) S'Hd 
Qxoid^i)- The distance increasing property implies, as in Lemma 3, that 

sin (pi < Cg/Ri. 

The angle (pi can be regarded as the distance dist( , ) between the points and 
Aii.{g,i/i) in the sphere Sxq- In other words we have 

C 

dist {{ui,Aii,{g,Ui)) < sin'^^). 

Hi 

Now take the limit as z ^ cxd: we see that dist {v, A{g, v)) =0, which is to say 
that u is fixed by g. 

To prove Theorem 4, consider the gradient vector field grad F of the function 
F, and the associated flow 

— = -grad Fx 
at 

on M. By the standard theory, given any initial point there is a solution x{t) 
defined for some time interval (— T, T). 

Lemma 6 If x{t) and y{t) are two solutions of the gradient flow equation, for 
t G {—T,T), then d{x{t),y{t)) is a non-increasing function oft. 

If x(t) and y{t) coincide for some t then they must do so for all t, by uniqueness 
to the solution of the flow equation, and in that case the result is certainly true. 
If x{t) and y{t) are always different then the function D{t) = d{x{t),y{t)) is 
smooth: we compute the derivative at some fixed to. Let 7(5) be the geodesic 
from x{to) = 7(0) to y{to) = -f{D). Clearly 

D\to) = {gvadFx^t),j'{0)) - (gradF^(t),7'(^)). 

But (gradi^^(j5'), 7'(s)) is the derivative of the function ^"07(5), which is nonde- 
creasing in s by the convexity hypothesis, so D'{to) > 0, as required. 

A first consequence of this Lemma — applied to y{t) = x{t+6) and taking the 
limit as 5 ^ — is that the velocity |^|of a gradient path is decreasing. Thus 
for finite positive time x{t) stays in an a priori determined compact subset of M 
(since this manifold is complete). It follows that the flow is actually defined for 
all positive time, for any initial condition. Consider an arbitrary initial point 2:0 
and let x{t) be this gradient path, for t >0. If there is a sequence ii ^ 00 such 
that x{ti) is bounded then, taking a subsequence, we can suppose that x{ti) 
converges and it follows in a standard way that the limit is a minimiim of F. 
If there is no such minimum then we can take a sequence such that Xi — x{ti) 
tends to infinity. Suppose that g is in F, so the action of 3 on M preserves F 
and the metric. Then y{t) = g{x{t)) is a gradient path with initial value g{xo) 
and d{xi,gxi) < Cg = d{xo,gxo). Then, by Lemma 5, there is a fixed point for 
the action of F on S'oo(M). 
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There is an alternative argument which is perhaps more elementary, although 
takes more space to write down in detail. With a fixed base point xo choose c 
with infjvf F < c < F{xq) and let Sc be the hypersurface F~^{c). Let Zc S Sc 
be a point which minimises the distance to xq, so that xq lies on a geodesic 7 
from Zc normal to Sc. The convexity of F implies that the second fundamental 
form of Sc at Zc is positive with respect to the normal given by the geodesic 
from Zc to Xq. a standard comparison argument in "Fermi coordinates" shows 
that the exponential map on the normal bundle of is distance increasing on 
the side towards xq- In particular, let w be another point in Sc and y = exp(i?^) 
where ^ is the unit normal to E,; at w pointing in the direction of increasing _F 
and R = d{xo,Zc). Then we have d{zc,w) < d{xQ,y). Now suppose g is in F. 
Then g preserves Sc and if we take w = g{zc) above we have y = g{xo)- So we 
conclude from this comparison argument that d{zc, g{zc)) < d(xQ, gxa). Now 
take a sequence Cj decreasing to inf F (which could be finite or infinite). We get 
a sequence Xi = z^ of points in M. If (xi) contains a bounded subsequence then 
we readily deduce that there is a minimum of F. If Xi tends to infinity we get 
a sequence to which we can apply Lemma 5, since d{xi,gxi) < C'g = d{xo,gxo). 

5 Discussion 

5.1 Consequences of Theorem 1 

• We start with a simple Lie algebra g and use Theorem 1 to obtain an 
embedding g C End(F), for a Euclidean space V, with g preserved by 
the transposition map. We also have a corresponding Lie group G C 
SL{V). We write K for the identity component of G n SO{V). It follows 
immediately from Theorem 1 that K is a. maximal compact connected 
subgroup of G, and any maximal compact connected subgroup is conjugate 
to K. 

• The involution a 1-^ —a^ on End(V) induces a Cartan involution of g so 

we have an eigenspace decomposition 

g = k©p 

with k = Lie{K) and 

[k,k] Ck, [k,p] CP, [p,p] Ck. (2) 
Notice that k is non-trivial, for otherwise g would be abelian. 

• Consider the bilinear form B{a, 0) = Tr{af3) on g. Clearly this is positive 
definite on p, negative definite on k and the two spaces are _B-orthogonal. 
Thus B is nondegenerate. The Killing form i? of g is negative-definite on 
k (since the restriction of the adjoint action to K preserves some metric 
and k is not an ideal). So the Killing form is not identically zero and must 
be a positive multiple of B (otherwise the relative eigenspaces would be 
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proper ideals). In fact we do not really need this step, since in our proof 
of Theorem 1 the vector space is g itself, and B is trivially equal to 
the Killing form. 

• Either p is trivial, in which case G is itself compact, or there is a nontrivial 
Riemannian symmetric space of negative type M~ = G/K associated 
to g. This can be described rather explicitly. Let us now fix on the 
specific representation g c End(g) used in the proof of Theorem 1. Say a 
Euclidean metric on g is "optimal" if the adjoint embedding is symmetric 
with respect to the metric, as in Theorem 1. (It is easy to see that the 
optimal metrics are exactly those which minimise the norm of the bracket, 
among all metrics of a given determinant.) Then can be identified 
with the set of optimal metrics, a totally geodesic submanifold oi Ti = 
SLig)/SOig). 

• So far we have worked exclusively in the real setting. We will now see 
how to derive the existence of compact real forms of a simple complex Lie 
algebra. 

Lemma 7 If g is a simple complex Lie algebra then it is also simple when 
regarded as a real Lie algebra. 

To see this, suppose that A c g is a proper real ideal: a real vector 
subspace with [A, g] C A. By complex linearity of the bracket A O iA is 
a complex ideal, so we must have AlliA = 0. But then since ig = g we 
have [A,g] = [A,ig] = i[A,g\ c iA, so [A,g] = 0. But A + iA is another 
complex ideal, so we must have g = iA® A and g is Abelian. 

Next we have 

Lemma 8 Let g be a simple complex Lie algebra and let g = Lie(G) C 

EndV he an embedding provided by Theorem 1. regarding g as a real Lie 
algebra. Then g is the complexification of the Lie algebra of the compact 
group K = Gr)SO{V). 

The inclusions (2) imply that 

/= (pnik) + (knip), 

is a complex ideal in g, so either / = g or J = 0. In the first case we 
have ik = p and g is the complexification of k, as required. So we have 
to rule out the second case. If this were to hold we have k fl ip = so 
g = k © (ip) . Then [ip, ip] C k so the map a on g given by multiplication 
by 1 on k and by —1 on ip is another involution of g, regarded as a real 
Lie algebra. Now let Be be the Killing form regarded as a complex Lie 
algebra. So _B = 2Rc_Bc. The fact that a is an involution of g means 
that B{k,ip) = 0. But we know that p is the orthogonal complement of 
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k with respect to B and B, so we must have ip — p. But B is positive 
definite on p while B{ia, ia) = 2ReBc{ia, ia) — —B{a, a) so p fl ip = 0. 
This means that p = and g = k which is clearly impossible (by the same 
argument with the Killing form). 

• The argument above probably obscures the picture. If one is interested 
in the complex situation it is much clearer to redo the whole proof in 
this setting, working with Hcrmitian metrics on complex representation 
spaces. The proof goes through essentially word-for-word, using the fact 
that the standard metric on SL{n, C)/SU{n) has Property (*).Then one 
can deduce the real case from the complex case rather than the other way 
around, as we have done above. 

• Returning to the case of a simple real Lie algebra g, which is not the Lie 
algebra of a compact group, we can also give an explicit description of the 
symmetric space of positive type dual to M~ . Fix an optimal metric 
on g and extend it to a Hermitian metric on g (g) C. Then Af + is the 
set of real forms g' C g ® C which are conjugate by G"^ to g and such that 
the restriction of Re H to g' is an optimal metric on g'. This is a totally 
geodesic submanifold of SU{g (g) C)/S'0(g). 

5.2 Comparison with other approaches 

The approach we have used, minimising the norm of the Lie bracket, is essen- 
tially the same as that suggested by Cartan, and carried through by Richardson, 
with the difference that we do not assume known that the Killing form is non- 
degenerate so we operate with a special linear group rather than an orthogonal 
group. The crucial problem is to show that the minimum is attained when the 
Lie algebra is simple. This can be attacked by considering points in the closure 
of the relevant orbit for the action on the projectivized space. Richardson gives 
two different arguments. One uses the fact that a semisimple Lie algebra is 
rigid with respect to small deformations; the other uses the fact that a semisim- 
ple Lie algebra is its own algebra of derivations, so the orbits in the variety of 
semisimple Lie algebras all have the same dimension. 

There is a general procedure for testing when an orbit contains a minimal 
vector, using Hilbert's 1-parameter subgroup criterion for stability in the sense 
of Geometric Invariant Theory [9]. In the Lie algebra situation this gives a 
criterion involving the nonexistence of filtrations of a certain kind, but the 
author does not know an easy argument to show that simple Lie algebras do 
not have such filtrations. However it is also a general fact that, in the unstable 
case, there is a preferred maximally destabilising 1-parameter subgroup. This 
theory was developed by Kempf [7] and Hesselink [5] in the algebraic setting, 
and in connection with the moment map and the length function by Kirwan [6] 
and Ness ^2\- The argument we give in Section 4 is essentially a translation 
of this theory into a differential geometric setting. Lauret [TO] has studied the 
application of this general circle of ideas (Geometric Invariant Theory/Moment 
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maps/minimal vectors) to more sophisticated questions in Lie algebra theory — 
going beyond the case of simple algebras. 

One advantage of this method, in the real case, is that the uniqueness of 
maximal compact subgroups up to conjugacy emerges as part of the package. 
In the usual approach ([i]. Theorem 13.5) this is deduced from a separate argu- 
ment: Cartan's fixed point theorem for spaces of negative curvature. We avoid 
this, although the techniques we apply in Section 4 are very similar in spirit. 

6 Appendix 

We give a simple proof of the well-known fact stated in Section 4: that the 
manifold Ti. has Property (*). We identify Ti with n x n positive definite sym- 
metric matrices of determinant 1. It suffices to prove the statement for the 
exponential map at the identity matrix. Recall that the metric on H is given by 
|i57f||j = Tr (^{SH)H~^) . For fixed symmetric matrices S,a and a small real 
parameter h define 

H{h) = (exp(S' -I- ha) - exp(S')) exp(-S'), 

and 

dH, 
' = ^^'="- 
we need to show that, for any S and a, we have 

Tr v'^ > Tr a^. 
To see this we introduce another real parameter t and set 

Hit, h) = (exp(t(S' -t- ha)) - exp(t5)) exp(-tS'). 
Then one readily computes 
dH 

— = [S, H] + ha exp{t{S + ha)) exp{-tS). 
at 

Now differentiate with respect to h and evaluate aX h — Q to get a matrix valued 
function V{t). Then we have 

Clearly v — V{1) and V^(0) — 0, so our result follows from the following 

Lemma 9 Let S, a be real, symmetric nxn matrices and let V(t) be the matrix 
valued function which is the solution of the ODE 

with V{0) = 0. Then 

Tr y(t)2 > i^Tr a^ 

for all t. 
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To see this, consider first a scalar equation 



with A, a constants and with the initial condition /"""(O) = 0. The solution is 

fHt) ' - 



A 

where we understand the expression in brackets is to be interpreted as t in the 
case when A = 0. Let satisfy the similar equation 

with /-(O) = 0. Then 
where 

_ (e^*-l)(l-e-^*) _ 2(cosh(A0 - 1) 

It is elementary that Q{t) > 1, so f^{t)f^{t) > t^a?. 

Now consider the operator ads acting on n x n matrices. We can suppose 
S is diagonal with eigenvalues \i. Then a basis of eigenvectors for ads is given 
by the standard elementary matrices Eij and 

ads(i?y ) = XijEij, 

where Ay = A^ — \j. Thus the matrix equation reduces to a collection of scalar 
equations for the components Vij{t). Since Xji = —Xij and aij = aji, each pair 
Vij, Vji satisfy the conditions considered for f^,f~ above and we have 

(This is also true, with equality, when i — j). Now summing over i,j gives the 
result. 

This proof is not very different from the usual discussion of the Jacobi equa- 
tion in a symmetric space. It is also much the same as the proof of Helgason's 
formula for the derivative of the exponential map ([1], Theorem 1.7). 
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